Minimal work principle: proof and counterexamples 
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The minimal work principle states that work done on a thermally isolated equilibrium system 
is minimal for adiabatically slow (reversible) realization of a given process. This principle, one 
of the formulations of the second law, is studied here for finite (possibly large) quantum systems 
interacting with macroscopic sources of work. It is shown to be valid as long as the adiabatic energy 
levels do not cross. If level crossing does occur, counter examples are discussed, showing that the 
minimal work principle can be violated and that optimal processes are neither adiabatically slow 
nor reversible. The results are corroborated by an exactly solvable model. 
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I. INTRODUCTION 

The second law of thermodynamics Q, 0, IE H , formu- 
lated nearly one and half century ago, continues to be 
under scrutiny IE 13 • While its status within equi- 
librium thermodynamics and statistical physics is by now 
well-settled 0, 0, IE 13 j its fate in various border situa- 
tions is far from being clear [E IE IE IE] • I n the macro- 
scopic realm the second law is a set of equivalent state- 
ments concerning quantities such as entropy, heat, work, 
etc. In more general situations these statements need not 
be equivalent and some, e.g. those involving entropy, may 
have only a limited applicability |E 13 ■ In contrast to en- 
tropy, the concept of work has a well-defined operational 
meaning for an arbitrary system interacting with macro- 
scopic work sources 0,12, EI- Moreover, the definition of 
work is not confined to (nearly) equilibrium situations: 
it is defined equally well for processes that start and end 
in arbitrary non-equilibrium states Q- [3- 0] ■ 

In this perspective the second law can be regarded as 
a set of several statements concerning the work. They 
were first derived from observations and have direct coun- 
terparts in everyday experience. Two formulations of 
the second law are at the focus of our present interest 
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• Thomson's formulation: no work can be extracted 
from an equilibrium system by means of a cyclic 
process generated by an external work source. 

• The minimal work principle: when varying the 
speed of a given process done on an (initially) equi- 
librium system, the work is minimal for the slowest 
realization of the process (more details of the prin- 
ciple are given below). 

These two statements are well defined — both opera- 
tionally and conceptually — for a finite system coupled 
to macroscopic sources of work. It is therefore interesting 
to consider whether they are valid in this domain. This is 
an inquiry towards one of the most pertinent questions in 
the foundations of statistical physics: whether the second 
law is valid for finite systems, or, put differently, whether 



the thermodynamical limit is really necessary for the va- 
lidity of the second law. Some historical developments of 
this question are recalled in the Appendix. 

Thomson's formulation was derived with great rigor 
from the principles of quantum mechanics |3, llfl IllL Il2| . 
It is valid not only for a macroscopic, but also for a finite 
system coupled to sources of work. This fact is not oc- 
casional: would Thomson's formulation be violated for a 
finite (e.g., few-level) system, it would immediately im- 
ply violations for large collections of such systems, i.e., 
for some macroscopic systems. 

There are also several pertinent derivations of the min- 
imal work principle. They cover some important but still 
particular cases, such as the linear response limit (weak 
coupling to the work source) 0,0, quasi-slow processes 
fl7| , and constant temperature processes for macroscopic 
systems [n], [lj, U3, EH ■ So far all these derivations were 
confirming the validity of the minimal work principle. 

Our present purpose is to study the minimal work prin- 
ciple for a finite (possibly large) quantum system coupled 
to sources of work. We derive the principle for finite sys- 
tems and then discuss its limits. It is found that — in 
contrast to Thomson's formulation — the domain of the 
validity of the minimal work principle is large but def- 
initely limited. These limits are connected with cross- 
ing of adiabatic energy levels, and they are illustrated 
via counterexamples which include an exactly solvable 
model. A more detailed discussion on what, to our opin- 
ion, qualifies as a limit of the second law, is given in the 
Appendix. 

The phenomenon of level-crossing (conical intersec- 
tions), which was once little more than a theoretical 
curiosity, attracted recently much attention in chemical 
physics, quantum chemistry and biophysics; see Ref. pof 
for a review. Its importance was recognized for such es- 
sential processes as charge transfer reactions, light har- 
vesting and ultrafast decay of excited states. We thus 
show here that the same phenomenon of level crossing is 
crucial for the proper understanding of the second law. 

The paper is organized as follows. In section [H] we re- 
call the basic setup of the problem. Section ITTll discusses 
the minimal work principle and its various implications. 
Then the principle is derived for finite systems. Section 
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IVl studies the limits of the principle, while section El dis- 
cusses relations of the principle to Thomson's formula- 
tion of the second law and to cyclic processes. The last 
section presents our conclusions. 

II. THE SETUP. 

A. Thermally isolated processes. 

Consider a quantum system S which is thermally iso- 
lated 0,0,0] : it moves according to its own dynamics and 
interacts with an external macroscopic work source. This 
interaction is realized via time-dependence of some pa- 
rameters R(t) = {Ri(t),R2(t), ...} of the system's Hamil- 
tonian H(t) = H{R(t)}. They move along a certain tra- 
jectory R(t) which at some initial time U starts from 
Ri = R(U), and ends at R{ = R(t[). The initial and final 
values of the Hamiltonian are 

Hi = H{Ri} and H { = H{R f }, (1) 

respectively. 

The Hamiltonian H(t) generates a unitary evolution: 

ih~ P (t) = [H(t),p(t)}, P (t) = uWpMrt®, (2) 

with time-ordered 

U(t)=&v[~£d8H(s)]. (3) 

The interaction of S with the source is related with 
flows of work which qualifies as a high-graded (ordered 
or mechanical) type of energy. The work W done on S 
reads [U, 0, H, |j 

W = jT dttr W)H{t)] = tr[flfp(tf)] - ixlHipfa)], (4) 

where we performed partial integration and inserted JSJ. 
This is the average energy increase of S, which, due to 
energy conservation, coincides with the average energy 
decrease of the source Q, 0] ■ Thus there are two ways to 
measure work. One can let the work-source to interact 
with an ensemble of systems S and then to measure the 
energy change of the work-source. Due to the macro- 
scopic size of the latter, its energy practically coincides 
with its average 0,0- Or, alternatively, one can measure 
directly the initial and final average energies of S, e.g., 
via measuring the Hamiltonians Hi and Hf, respectively, 
on the ensemble of systems S. 

B. Initial state. 

Initially the system S is assumed to be in equilibrium 
at temperature T = 1/(3 > 0, that is, S is described by a 



gibbsian density operator: 

p(U) = i expt-fiHi), Zi = txe- 0H \ (5) 

This equilibrium state can be prepared by a weak in- 
teraction between S and a macroscopic thermal bath at 
temperature T 0, , and then decoupling S from 
the bath in order to achieve a thermally isolated process 
HI 0> 0] • The fact of preparing S via a thermal bath 
has the following implications: 

1. Reproducibility: The preparation process leading 
to p(ti) can in principle be repeated the needed 
amount of times, and thus various effects displayed 
by S and described by density matrix p can be am- 
plified to the needed extent. Moreover, this repro- 
duction is in principle not connected with energy 
costs: due to the weak-coupling to the bath no (or 
little) energy should be spent for switching the cou- 
pling on and off. 

2. According to quantum mechanics, p{U) allows to 
predict probabilities for various results obtained via 
measurements done on S. In this sense p{t\) can 
be thought to describe an ensemble of identically 
prepared systems S, rather than one preparation 
of a single system. Since it was prepared via a 
thermal bath only — i.e., without additional mea- 
surements employed to separate the ensemble into 
sub-ensembles — no physical interpretation should 
be based on a choice of sub-ensembles for p(U). 
In particular, it will be incorrect to interpret p(ti) 
as " S is with some probability in a state with a 
definite but unknown energy (eigenvalue of Hi)". 
This interpretation is allowed for a classical Gibbs 
distribution, but in the quantum situation it may 
lead to inconsistencies with experiment; see, e.g., in 
Ref. (3l| . One of the reasons prohibiting such an in- 
terpretation is that separation of a mixed-state en- 
semble into pure-state components (sub-ensembles) 
depends on the concrete measurement done for this 
purpose and therefore is not unique. 

III. THE MINIMAL WORK PRINCIPLE. 
A. Formulation of the principle. 

Let S start in the state (J5J, and let the parameters 
R(t) move between Ri and Ri along a trajectory R(t). 
The work done on S during this process is W. Consider 
the adiabatically slow realization of this process: R pro- 
ceeds between the same values Ri and Rf and along the 
same trajectory, but now with a homogeneously vanish- 
ing speed, thereby taking a very long time t{ — ti, at the 
cost of an amount work W. The minimal- work principle 
then asserts Q, 0, 

W > W. (6) 
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This is a statement on optimality: if work has to be ex- 
tracted from S, is negative, and to make it as negative 
as possible one proceeds with a very slow speed. If dur- 
ing some operation work has to be added (W > 0) to S, 
one wishes to minimize its amount, and again operates 
slowly. 

The following remarks intend to clarify the physical 
meaning of the principle and to prevent its improper use. 

a. For thermally isolated systems, adiabatically 
slow processes are reversible. This is standard if S is 
macroscopic 0,00; an d below in section IVTI it is shown 
to hold for a finite S as well, where the definition of re- 
versibility extends unambiguously (i.e., without invoking 
entropy) 0. Thus, the minimal work principle states 
that the optimal thermally isolated processes are re- 
versible. 

b. The formulation of the minimal work principle 
does not by itself give any detailed information on the 
precise meaning of being "slow" . Only its derivation 
from first principles can be informative in this respect, 
and this is one of the reasons why such derivations are 
really needed. It suffices to say at the present moment 
that "slow" means "slower than the characteristic times 
of the system relevant for calculating work" , and cer- 
tainly does not mean "slower than characteristic times 
of the system S without any interaction with external 
sources of work" . Indeed, we shall see below that in gen- 
eral the relevant characteristic times become determined 
only once the very process R(t) — with its initial and 
final points i?i and R{ — is given. The operational def- 
inition of slow processes in the context of the minimal 
work principle is straightforward: one increases the char- 
acteristic time-scale r of the process till the work as a 
function of r saturates, i.e., it does not anymore depend 
on r. It is conceivable that for some systems there will 
be several type of slow processes, that is, the work as a 
function of r will display several plateaus. If so, then 
each typ e of slow processes can be studied for its own 
sake git EE3- 

c. Note that as far as the work — i.e., average en- 
ergy lost by the work source — is concerned, any processes 
can be considered as a part of a thermally isolated one, 
provided S is defined to be the whole system which in- 
teracts with the source of work and which together with 
the latter forms a closed system 1 (see additionally in this 
context Eq. i|18|) and the remark after it). Thus, the 
statement of the minimal work principle is more general 
than it may seem. 



This is akin to the known statement that any dynamical evo- 
lution in quantum mechanics can be viewed as a part of a uni- 
tary evolution, provided all the environment of the system was 
included into the description. Such statements are sometimes 
regarded as too general and therefore useless. However, they 
can be very useful as instanced by the formalism of completely 
positive operations for open quantum systems 0|- 



d. Finally, it may be useful to comment on the used 
nomenclature. Following Ref. pj we call a slow thermally 
isolated process adiabatic. Now and then for sharpening 
the needed context we use equivalently the term "adia- 
batically slow" . Note that this by no means presupposes 
validity of the quantum-mechanical adiabatic theorem. 

B. The minimal work principle for macroscopic 
systems. 

In macroscopic thermodynamics the minimal work 
principle is derived 0, 0] from certain axioms which en- 
sure that, within the domain of their applicability, this 
principle is equivalent to other formulations of the second 
law. Derivations in the context of statistical thermody- 
namics are presented in Refs. H El IH 13 IH 13 . 

In the following discussion we will reproduce a proof of 
the minimal work principle for a class of thermally iso- 
lated processes realized on a macroscopic system. Our 
purpose is to understand why precisely the principle 
holds in this situation, and thereby to motivate its in- 
vestigation for finite systems. 

The derivation proceeds in two steps. First one con- 
siders the relative entropy 

S[p(t f )\\p e(l (H { )] = tr[p(t f ) In p(U) - p{tt) In p eq (Jf f )}, (7) 

between the final state p(U) given by 10 and an equilib- 
rium state 

Pcq (H f ) = exp(-pH t )/Z u Z { = tte-P H ', (8) 

which corresponds to the final Hamiltonian Hf and to the 
same temperature T — 1/(3 as in the initial state p(U). 

The relative entropy S , [p||cr] is known to be non- 
negative for any density matrices p and a. Among other 
useful features, it can serve as a (non-symmetric) "dis- 
tance" between p and a, since S^pHer] = implies p = a. 
Applications of relative entropy in statistical physics and 
quantum information theory are reviewed in Refs. 

As follows from the unitarity of the evolution operator 
©, 

Si = -tr[p(t t ) In p(t t )] = -tr[p(ti)lnp(ti)] = Si. (9) 

This is the well-known conservation of Von Neumann en- 
tropy during a thermally isolated process. 

The definition of relative entropy JJJ), combined with 
Eqs. 0IH1II3) and with the definition of work (0J brings: 

S[p(t { )\\p etl (H { )} = tr[p(«0 \np(U) - p{U) \np cq (H t )} 

= f3W + \nZ i -\nZ- l . (10) 

Using the non-negativity of the relative entropy one 
gets 

W= F(Ht)-F{H{)+TS\p{ti)\\p«£.H t )] (11) 
> F{H { ) - F(Hi) (12) 
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where F(H) is the free energies corresponding to Hamil- 
tonian H and temperature T: 

F{H) = -T\ntve- 0H . (13) 

This inequality is well-known 0, (though is derived 
and formulated less explicitly) and sometimes is viewed 
as a proof of the minimal work principle 2 . However, 
we still have to show that the work W in the slow limit 
coincides with the difference of free energies given by the 
right hand side (RHS) of Eq. (JT2J. Recall that the latter 
quantity was so far defined only formally. 

Eq. l|T^|l gets the needed physical meaning when one 
assumes that for a macroscopic system S, the final state 
p(tf) — reached from p(t{) = exp(— fiH\)jZ\ by the adia- 
batically slow process — can be approximated by p oq (H{) 
defined in Eq. ©: 

S\p(tt)\\P«i(H f )]KO, (14) 

where ~ means that the equality is supposed to recover in 
the macroscopic (thermodynamic) limit for S. This then 
leads from Eq. 111( 1 to the needed relation 

W « F(H f ) - F{Hi). (15) 

The statement of the minimal work principle then 
follows from Eqs. PIT51). 

In the second step of the derivation one should prove 
Eq. (|15fl . There are several classes of macroscopic sys- 
tems for which one can show that the free energy dif- 
ference F(Hf) — F(Hj) in i|12|) indeed coincides with the 
adiabatic work 0, fill Hsj . We recall below one of them. 

Assume that S consists of two parts: macroscopic ther- 
mal bath B and a subsystem (particle) P coupled to it: 

H{t)=H P {R(t)} + H B +gH l , (16) 

where Hp (t) and Hp, stand for the Hamiltonians of P and 
B, respectively, and where H\ is the interaction Hamilto- 
nian with g being the corresponding coupling constant. 
The source of work interacts with the particle only, thus 
only Hp{R(t)} is time-dependent. 

The following conditions are usually considered to be 
sufficient for the validity of Eq. (|15f) ; see Q, ■ Along 
the lines of Ref. 8] , we present them in a slightly more 
formalized and particular way, and we recall that they 
were checked in models pi, Il8j. 

• The thermal bath B is composed of a macroscopic 
number of harmonic oscillators with a proper (e.g. 
ohmic) spectrum of their interaction with P. 



2 Recently inequality 1121 was generalized within so called work- 
fluctuations theorems ll2llT9ll . These theorems account for fluc- 
tuations which appear when measuring the work via the system's 
energy. 



• The evolution generated by Hit) in i|16|l starts 
from an equilibrium state (JHJ) for the total system 
S=B+P. 

• The characteristic time of the external process R(t) 
is assumed to be much larger than the relevant 
times of the particle P. These times include those 
generated by the Hamiltonian Hp{R(t)}, as well as 
the relaxation times of P induced by the bath. The 
latter times are controlled by the interaction Hamil- 
tonian Hi and they become very long for g — ► 0. 
Note that since the thermal bath B is assumed to 
be macroscopic (dense spectrum), there are char- 
acteristic times of B (so called Heisenberg times) 
which are proportional to inverse level-spacing and 
are thus very large. They, however, do not enter 
into the definition of adiabatically slow processes. 
All the relevant characteristic times are finite in the 
thcrmodynamical limit for the bath, at least for the 
type of models considered in [a] . 

The above three conditions are sufficient for the density 
operator of P to be given as Q 

flp(t) = ^tr B e-«> Z(t) = tre-^M (17) 

where trB(p) means trace over the bath (particle) degrees 
of freedom. The physical meaning of (|17fl is obvious: 
in the slow limit P is in the local-equilibrium (or local- 
stationary) state 3 . 

Note that the state of P at time t need not at all co- 
incide with the local Gibbsian e _/3Hp(t) /Z F (t), since no 
weak-coupling assumption on the system-bath interac- 
tion was made. 

Eqs. l(H 1161 n"7)l suffice to derive relation l|15fl between 
the work W done for the adiabatically slow process and 
the difference F(H[) — F(Hi) of free energies. Indeed, 
note from Eq. that d t H(t) = d t H P (t) and gj) can 
be written as 

W = / dttrp[p P (i) d t Hp{t)]. (18) 

This relation shows that the work defined globally via 
the energy difference of the overall system of the particle 
and the bath can also be calculated via integration of the 



3 For the actual calculation in Eq. 1171 . one may need to keep the 
bath large but finite, to carry out the trace tre, and only then 
to go to the macroscopic limit for the bath. Then the density 
matrix pp(t) is finite and well-behaved although the quantities 
like Z(t) = tr e~^ H ^ may not be well-defined in the macroscopic 
limit for the bath. This is a standard procedure in statistical 
physics and it is legitimate for the present situation, since the 
thermodynamical limit for the bath commutes with the limit of 
slow processes, as we already recalled above. The validity of such 
commutation was also seen in Ref. Il7l. 
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expression in the RHS of Eq. I|18|) . which contains only 
quantities referring to the particle P. 

Now proceed with application of Eq. i(T7|) : 



W = 



dt 

u W) 
tf dt 



trp trp 



z{t) 
tf 



tr 



e -mt) d tH (t) 



= -T jf 'di^ lnZ(t) = F(H{)-F(Hi). (19) 

Thus the minimal work principle (jSJ) is proved for this 
class of processes realized on macroscopic systems. One 
may note once again that although the quantities F(H{) 
and F(Hi) are very large in the macroscopic limit for the 
bath, their difference is finite and is order of the particle's 
energy. This is also seen from (|18f) which contains only 
quantities referring to the particle. 



IV. 



FINITE SYSTEMS. 



Let us now turn to a finite TV- level quantum system S. 4 
The first thing to do is to apply here the reasoning devel- 
oped above for macroscopic systems. In fact, Thomson's 
formulation of the second law, W > 0, is valid for finite 
systems for precisely the same reasons as it is valid for 
macroscopic ones. Indeed, since this formulation refers 
to cyclic processes, one puts Hr = Hi in Eqs.|0 IT2"j) - 
this can be done without altering the validity of (jT2|l . 
since p eq is in general an auxiliary density matrix — and 
gets W > for cyclic processes, which is the statement 
of Thomson's formulation. 

However, when trying to apply the above reasoning 
for finite systems, one immediately sees that there are 
no reasons why the work in the slow limit should be 
equal to the difference in free energies. Neither are there 
reasons to expect validity of Eq. (|14[1 . that is, to ex- 
pect that in general the final state p(t{) reached from 
p(U) = exp(—f3Hi)/Zi during the adiabatically slow pro- 
cess, should be equal to p cq (H{) defined by JHJl. The 
reason for this can be seen by noting that due to the uni- 
tarity of the evolution operator U in @, the spectrum 
of p(tf ) coincides with that of the initial density matrix 
p(t[), and in general does not have the Gibbsian shape of 
the spectrum of p cq (H{). 



4 Note that an attempt was made rec ently to study the minimal 
work principle for finite systems ll-J . However, the result of this 
study is to our opinion incomplete, since the author of Ref. ll.J 
obtained that the principle has the same range of validity as 
Thomson's formulation of the second law. This came from in- 
correct treatment of adiabatically slow pro cesses. It is however 
to be mentioned that the author of Ref. 13j has stressed the pre- 
liminary character of his results, and that his results concerning 
cyclic processes are correct. 



The facts that p(t t ) ^ p eq (H f ) and W ^ F(H f )-F(Hi) 
for finite and certain macroscopic systems are known 
and were studied especially in the linear response regime 
[Hill HI HI El IH Here are some results obtained 
within that activity: i) A class of finite systems was de- 
termined for which the adiabatic work coincides with the 
difference in free energies 113, 13- ii) Conditions were de- 
termined for macroscopic systems, under which the adi- 
abatic work converges to the free energy difference in 
the thermodynamical limit 0, HE H3- Concrete esti- 
mates for the rate of convergence were given. Hi) It was 
found that for certain non-trivial macroscopic systems 
the above convergence can be absent even in the ther- 
modynamical limit [3> • It was argued that quantum 
systems are more vulnerable in this respect than classi- 
cal ones [24| . iv) The fact that p(t{) ^ p cq (Hf) was re- 
cently applied for studying certain processes which are re- 
versible in the macroscopic limit, but become irreversible 
for finite system |28| . 

Thus we cannot rely on macroscopic analogies and we 
need an independent derivation of the minimal work prin- 
ciple © for finite systems. Some ideas of the following 
derivation were adopted from Ref. [ljj- Let the spectral 
resolution of H(t) and p(ti) be 5 



JV 



H(t) = £ k(t)\k, t) (k, t\, (k, t\n, t) = S kn , (20) 

e -l3e k (ti) 



k=l 
N 



= $^Pk|Mi)(Mi|) 



Pa- 



fe=i 



En e-^»(t0- 



At t = ti we order the spectrum as 

ei(*i) < •■• < ejv(ii) 
Pi > ■■■ > Pn- 



(21) 



(22) 
(23) 



For any t in the interval t\ < t < tf we expand over the 
complete set |n, t): 

N 

U(t)\k,t i )=J2^n(t)e'^^ dt ' £ " {t ' ) \n,t), (24) 



n=l 



where 



Ofcn(t) = a kn (t;U) = (n,t\ U{t)\k,U) e» ^ dt ' £ " (t,) , 



(25) 



are the expansion coefficients, and where |afc„(tf)| 2 is the 
probability to measure energy £ n (t{) at the final time tf, 
provided the initial state was \k, ti){k, tj|. 



5 The eigenstates of p(t\) are used below as a calculational tool. It 
is by no means implied that "what really happens" is that the 
system S is — with some probability — in one of those states. See 
also the second remark in section fll Bl 
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Now we use Eqs. (|2"T1I24T) to obtain for the work (@J: 



JV 



JV 



W= J2 \akn(tf)\ 2 p k e n (tf)~J2Pk £ k(ti). (26) 



k.n — 1 



fe=l 



A similar formula can be derived to express the adiabatic 
work W in terms of coefficients afc n (tf): 



JV 



JV 



W= l«fc«(*f)| 2 'Pk£n{ti) -^p fc £fe(ti). (27) 



k,n— 1 /c— 1 

From the definition 1)25(1 , 

k„(t f )| 2 = |(M f |[/|Mi)| 2 , 

it follows that 6 

JV N 

^K„(if)| 2 = ^k„(tf)| 2 = i. 

fc=l n=l 

Employing the identity (summation by parts): 



(28) 



(29) 



JV JV-l 



/ J £ n X n = £N 22 X n - 2J [£ m +l - £m] ^n, (30) 



with x Tl 

> JV 



n— 1 m— 1 



SaLi |afen(^f)| 2 Pfc and 



|afen(if)| 2 Pfc, we obtain from Eqs. (23 l2"7|l 
and using Eqs. 1(29(1 a general formula for the difference 
between the non-adiabatic and adiabatic: work: 



JV-l 



w-w = ^[£ ro+ i(tf)-e m (tf)]e m , (31) 

m— 1 

m iV 

6 m = &*( l«fe"(*f)| 2 - kn(*f)| 2 ). (32) 



ra=l fc=l 



Let us now assume that the ordering ((22(1 is kept at 
the final time t = tf : 



ei(tf) < - <ejv(*f). 



(33) 



If different energy levels did not cross each other — i.e., 
different ones do not become equal and equal ones do 
not become different — Eq. 1(33(1 is implied by the initial 
ordering (J22J- 

The behavior of energy levels with respect to level- 
crossing is governed by the non-crossing rule, which was 
numerously discussed in literature and derived in a rather 
general situation j32, OS |3(j . We shall need it in the 
following particular formulation: 



6 As possible physical interpretation of feature 1291 . note that 
for the uniform distribution of the initial states, p(k, tf) = 
i, the prediction probability p(n; tf \ k, i;) = |afe n (if)| 2 
is equal to the retrodiction probability p(k, ti\n;tf) = 

p(fc.tj) p(n;t t | MO _ W P(";tf I Mi) 



£fc =1 P(Mi)p(";tf I Mi) ££Li ip(n;i f |Mi) 



: p(n; tf | k, tf). 



If H{R] is real and only one of its parameters is 
varied with time, 1(33(1 is satisfied for any discrete- 
level quantum system: level-crossing, even if it 
happens in model-dependent calculations or due 
to approximate symmetry, does not survive arbi- 
trary small perturbation where it is substituted 
by avoided crossing (for a more general H{R} the 
conditions prohibiting level-crossing are more re- 
strictive; see |35|)- To get a stable point of level- 
crossing, one needs at least two independently vary- 
ing parameters for a real H{R}. 



The rule is known since the early days of quantum 
mechanics [3^ and is presented in textbooks; see, e.g., 
Ref. |33j . It is an important tool in atomic and molecular 
spectroscopy, and development of these fields led people 
to reconsider its derivation. In particular, Ref. [34( cor- 
rectly criticizes the standard proof of the no-crossing rule 
for being not precise and not general enough. In response 
to this and several related criticisms, (at least) two com- 
plete and general derivations appeared which settled the 
issue •'>"). [3jj . 

No level-crossings and natural conditions of smooth- 
ness of H (t) are sufficient for the standard quantum adi- 
abatic theorem [37| to ensure 



Ofcn(if) — Skn- 



(34) 



This is the known statement on the absence of transitions 
for slow variations 7 . 



7 This standard statement of the adiabatic theorem was elabo- 
rated in literature several times so a s to provide information on 
the internal characteristic times T l4j . To be slow then will 
mean t»T. Dealing with a discrete spectrum and assuming, 
as we did above, that there are no level-crossings and that adia- 
batic energy levels are smooth functions of time, T can roughly 
be estimated via the i nver se of the minimal spacing between the 
involved energy levels |42| . In this context, the levels which pro- 
vide a small spacing are said to define avoided crossing. One 
should however keep in mind that this estimation cannot be ex- 
trapolated to situations with level crossing, where the minimal 
level-spacing is zero. In those cases the whole situation changes 
and the characteristic i nter nal times T can well be finite; see 
below and in Refs. |4(1 l4ll (compare also with our discussion 
in section IIII At . The same is the case for some situations in 
a continuous spectrum, where again the minimal level-spacing 
is ze ro, but the relevant characteristic times can be finite; see, 
e.g., l43t l44l . In short: the inverse of the minimal level-spacing 
defines the relevant characteristic time only for some particular 
cases, not in general. 
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Combined with Eqs. J^IIE}; the definition brings 



Pk 



k=l 



1 - ^ \a,kn(t{ 



n=l 



N 



> 



N 



Pk\a kn (ti)\ 2 

n—1 k— m+1 

mm i 

-Pm l a fc"(*f)| 2 + XI X! \ a kn(tf)\ 

_k—l n—1 n—1 fc— m+1 

+p rn m = p m (m - m) = 0. 



(35) 



Once O m are non-negative, the statement W > W of the 
minimal work principle follows from Eqs. (|26l 1311 l3*3*j) . 

Recall once again the basic ingredients of the proof: 

• The condition i|29|l which came from the unitarity 
of the evolution during a thermally isolated process. 

• The no-crossing assumption which lead to Eq. i|33|) 
and simultaneously — via the standard adiabatic 
theorem — to Eq. (|3l1) . 

• We did not assumed that the adiabatic work is ci- 
ther related or equal to the difference in free ener- 
gies. 

• Only two features of the initial state p{t{) were 
used: [H- u p{ti)\ — determined the specific form 
(pfl 127) 1 of the work, while Eq. wa s used in 
proving TO > in Eq. (j^. 

As an immediate application of the obtained results 
note that if only one parameter is varied, we are ensured 
of the absence of level crossings and the minimal work 
principle is valid. Among many examples of this situa- 
tion there is the case of a gas — which may consist of any 
number of particles — in a rectangular container inter- 
acting with one of its walls moving in time (the standard 
setup for a one-parameter thermally isolated process). 



V. LEVEL CROSSING. 

The above non-crossing condition raises the question: 
Is the minimal work principle also valid if the adiabatic 
energy levels cross? Before addressing this question in 
detail, let us mention some popular misconceptions which 
surround the level-crossing problem: 

- The no-crossing rule is said to exclude all cross- 
ings. This is incorrect as the exclusion concerns situa- 
tions where, in particular, only one independent param- 
eter of a real Hamiltonian H{R] is varied |35j. Two 
parameters can produce robust level-crossing for such 
Hamiltonians. 

— One accepts that level-crossing can happen, but be- 
lieves it to be very rare and thus irrelevant for any sensi- 
ble physical situation. This opinion is invalidated by the 



whole chapters of chemical physics |2JJ, |21j : level cross- 
ing is not only observed in many-(at least two)-atom 
molecules, where internuclear distances play the role of 
classical time-dependent parameters, but is necessary for 
the proper description of some known chemical reactions, 
as well as for predicting new ones. Over the years sev- 
eral methods were developed for identifying and locating 
the points of level-crossing. 8 In particular, the method 
based on the geometrical phase ("Berry phase") allows 
to deduce the existence of level-crossing from the behav- 
ior of the system at points remote from the crossing; see 
Refs. |2(J for more information. 

Note as well that level-crossing is a more frequent phe- 
nomenon than avoided crossing |2^. if the number of 
independently varying parameters is larger than two. 

— It is believed that once levels can cross, Ae — ► 0, the 
very point of defining slow processes disappears as the 
internal characteristic time h/ Ae of S is infinite. This 
view misidentifies the proper internal time as seen be- 
low (see also the discussion in section IIII Al and Foot- 
note [7J . The absence of level-crossings is indeed a suffi- 
cient condition for the validity of Eq. I|34|) (no transitions 
for slow changes). It is however by no means necessary 

Emm. 

- It is sometimes believed that crossing is automat- 
ically followed by a population inversion. We shall find 
no support for that. 



A. A two-level example within adiabatic 
perturbation theory. 

As a first example we consider a spin-1/2 particle with 
Hamiltonian 



H(s) = hx(s)ai - h 3 (s)a 3 

-h 3 (s) hx(s) 
hi(s) h 3 (s) 



t/r, 



(36) 
(37) 



where a\, a 3 and 02 = i&i&3 are Pauli matrices, and 
where s is the reduced time with r being the character- 
istic time-scale. The magnetic fields h\ and h 3 smoothly 
vary in time. 

Let us now make the following assumptions i) for s — > 
Si < and for s — » Sf > 0, h\(s) and h 3 (s) go to constant 
values sufficiently fast, ii) At s = t = both hi(s) 
and h 3 (s) turn to zero. This conditions indicates a level 
crossing, since crossed eigenvalues of the traceless 2x2 
matrix H(s) means that this matrix is equal to zero at the 
point of the crossing. Besides these two basic conditions, 
we shall assume few auxiliary ones, whose purpose is to 
make the discussion below more transparent. Hi) hi(s) 



8 This is an important issue, since some numerical or approximate 
analytical methods may easily miss points of level crossing or 
may produce spurious ones. 
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and h 3 (s) are non-zero for all s, s; < s < Sf, except 
s = 0. Due to the latter condition is finite for 

the involved s, except possibly s — > 0. If it infinite in this 
limit, then obviously its inverse j^fj is finite, now for all 
5, Si < s < Sf, and goes to zero for s — > 0. Since one 
of these ratios has to be finite, we shall assume that this 
finite ratio is ^*r< . /i 3 (s) > for s < and h 3 (s) < 
for s > 0. 

Here is a concrete example realizing the last two as- 
sumptions. For s — ► 0: 



ais 



-a 3 s, 



(38) 



where ai and a 3 are positive constants. 

With the above conditions on ft-i(s) and h 3 (s), one can 
propose a useful parametrization of the Hamiltonian (|37ll . 
Recalling that /i3(s) changes its sign at s = 0, Eq. i|3*7|) 
is written as 



H(s) = Sg (s)Jht(s) + hi(s) ' COS0{S) S[U9{S 



sin 9(s) - cos 0(s) / ' 
(39) 



where 



sin#(s) = sg(s) 



/ii(a) 



cos 9(s) = — sg(s) 



h 3 (s) 



(40) 



(41) 



Here ^C~. is defined to be always positive, sg(s) is the 
sign function and 



9(s) = — arctan 



hi(s) 



h 3 (s) 



(42) 



is a parameter in the interval — tt/2 < 9(s) < ir/2. Note 
that the presence of sg(s) in the above expressions is nec- 
essary for having a smooth parametrization. Otherwise, 
e.g., h 3 (s) / ' 'h 3 (s) + h\{s) is discontinuous at the cross- 
ing point s = 0. 

The eigenvalues and eigenvectors of H(s), 

H(s)\k,s)=e k (s)\k,s), fc = l,2, (43) 

are now read off by inspecting Eq. I|39|) : 

£l(s) =Sg(s)y / / i 2(s) + / l 2( S ), E 2 ( S ) = -£i( S ), (44) 




12, s) 



sin |^(s) 



cos k0(s) 



.(45) 



It is seen that the eigenvalues £i(s) and £ 2 (s) of H(s) 
(adiabatic energy levels) cross at s = 9(s) = 0. Note 
from the above conditions on h\(s) and ft.2(s) and from 
Eqs. I|42l I44|l that both the eigenvalues and the eigen- 
vectors of H(s) are smooth functions of s. This fact is 



important for subsequent calculations, but first of all it 
is necessary for the very definition of level-crossing. 

Now Eq. 1)22(1 for the ordering of initial energy levels is 
valid, but Eq. I133H for the same ordering of final energy 
levels is not valid due to the level-crossing at s = 0. 
Eqs. (|2 ^1 |P |l imply: 



W-W=-2^/h 2 1 (s i )+h 2 3 (s i )G 1 , rs t = t u (46) 

where 0i is defined by (|32|l . 

Our strategy is now to confirm relation (|34|l in the slow 
limit r — > oo and thus to confirm that ©i > 0, implying 
that the minimal work principle is indeed violated. 

To this end we apply the standard quantum mechanical 
adiabatic perturbation theory [3]} . Substituting Eq. I|24l) 
into Eq. (J2J) one has: 



N 
m— 1 



i/*dt'Mt')-e m (t')] 



(n, s\d t \m, s). 

(47) 



As |l,s) and |2,s) in Eq. (|45|1 are real, the normal- 
ization (n,s\n,s) = 1 implies (n, s\dt\n, s) = 0. Since 
(n, s\d t \m, s) = ±{n, s\d s \m, s) the RHS of (gTJ con- 
tains a small parameter 1/r. It is therefore more conve- 
nient to introduce new variables: a,k n (t) = Skn + bkn(t), 
bkn(ti) — 0. To leading order in 1/t, b kn can be ne- 
glected in the RHS of l(47|l . and after changing variables 
as st = t, s't = t', one gets for <Zfc^ n (if) = bk^ n (tf): 



\ak=£n(ti)\ 2 = 
while, due to normalization, 



n\d s \k) 



\a k k(t{)\ 2 = l-J2\ a kn(tf)f 



,(48) 



(49) 



For our model described by Eqs. l(3f)H45[) . the quantity 
du[si(u) - e 2 (u)] (50) 
2 / duei(u) (51) 



has only one extremal point, at s = 0. We also have from 
/9 h v _ _ 1 fei(s)/i 3 (s)-/i 3 (s)fe;(s) 

{2,S\d.\l, 8)- — -- h 2 {s)+h 2 {s) > ( 52 ) 

where 6'(s) = d#/ds. 

For large r the integral in Eq. I|48fl can be calculated 
with use of the saddle-point method: 



|ai 2 (*f)| 2 



T 



(2,s|9 s |l,s)V^(s) + fe|(s) 
h^h'^s) + h 3 (s)h' 3 (s) 



(53) 
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Substituting Eq. J3HJ| into Eq. one gets 



|aia(*f)|- 



4ra| 



(54) 



Eqs. l(15ll53ll5i|l extend the statement of the adiabatic 
theorem (|34f) for the level-crossing situation. More gen- 
eral versions of similar adiabatic theorems can be found 
in Refs. |40L |41| . Inserting 

6i = {Vi - P2)\ai 2 {U)\ 2 >0 (55) 

in Eq. (|46[) confirms the violation of the minimal work 
principle. Eq. (|54ll also shows that for the considered 
process the role of the proper internal characteristic time 
is played by ha\/a\ rather than by h/(ei — £ 2 ). 

Let us recall once again that the violation of the min- 
imal work principle is due to common influence of the 
following factors: 

• There is a level-crossing: a more populated state 
goes to higher energies, while a less populated one 
goes to lower energies. 

• For slow processes there are no transitions between 
various energy levels: a,k n (tf) — $kn- 

• For not very slow processes there do occur transi- 
tions: ak n (t{) 7^ Skn- They cost less work. 

Note as well that when the external fields ft-i(s) and 
/i3(s) in Eq. I|36[) are such that \/h\(s) + h\(s) is a 
smooth function for all real s, there are no crossings of 
eigenvalues and I© is valid. In the example of level- 
crossing given above, one has y/h\(s) + H\{s) cx |s| for 
small s. Note that if level-crossing is absent, the tran- 
sition probability |ai2 (if ) | 2 is small as 0(e~ T ), since the 
integral in Eq. Ij48(l is determined by the extremal point 
of H5()(l which is now complex; see [42j for more details. 



B. Fast processes. 

One can calculate \ak n (ti)\ 2 yet in another limiting 
case, where the characteristic time r is very short, while 
the change in h\{t) and h^(t) is finite and there is the 
level crossing at t = 0. 

It is well-known |37j that in this limit energy changes 
can be calculated with help of the frozen initial state of 
S. For the present situation this leads from Eq. I|45|) to 

|ai 2 (*f)| 2 = |a 2 i(if)| 2 



H<Mf|2,ii)| 2 =sin 2 -[0(i f )-0(ii)], 



and thus to 



(56) 



(57) 



which is again positive. This demonstrates that even 
very fast processes (i.e., r — > 0) can be more optimal 
than slow ones. It is conceivable that violations of the 
minimal work principle are maximal for some finite r. 
This expectation is confirmed below. 



C. Exactly solvable model with level crossing. 

The above results obtained by perturbation theory will 
now be corroborated on an exactly solvable displaying 
level-crossing. 

Consider a two-level system with Hamiltonian 



H(s) = fuo 



scos 2 s sin 2s 
is sin 2s ssin 2 s 



s = -, (58) 

T 



where r is the characteristic time-scale, s is the reduced 
time, and where w is a coupling constant. The model 
belongs to the pool of exactly solvable driven two-level 
systems. We learned on it from Ref. where some 
asymptotic features of its solution were studied. 
The eigenvectors 



|2,«> = 



cos s 



of H(s) correspond to the eigenvalues 

£l(s) = flLOS, £2(5) = 0, 



(59) 



(60) 



respectively. It is seen that the energy levels cross, when 
s = t/r passes through zero. 

Eqs. I|47|) read for the present case: 



— = e 1 a 12 , 

as 

da 12 (s) 



ds 



(61) 
(62) 



These equations can be solved exactly in terms of hyper- 
geometric functions 



ail(s) = Ci-ffe 



i 




, sx 


LOT 





i 1 
2^7' 2 



2 2 
, s x 



,(63) 



Cl 



ai 2 (s) = -g e 4 s x H c 



-c 2 se~ s x F 1 



1 . sx 

LOT 



2, ,2 



1 + w r s ' x 



(64) 



where H c and F\ = \F\ are, respectively, Hermite and 
hypergeometrical functions [48( , and where 



(65) 



The integration constants C\ and c 2 are determined from 
the initial conditions an(sj) = 1 and ai 2 (si) = 0. 

In the slow limit lot 3> 1, the transition probability can 
be calculated via the first-order adiabatic perturbation 
theory result (f4*8l) 



dse- iuJTs2/2 . 



(66) 
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With help of the saddle-point method one gets for i|66[l : 
| fll2 ( Sf )| 2 = — (67) 

LOT 

in agreement with l|54|l 9 . The characteristic time for 
the slow process is seen to be 1/lo. 

Recalling Eq. J2HJ), one deduces from Eqs. (1211221 for 
the present model: 

W - W = hus{ |ai 2 (s f )| 2 tanh(~/?/iu; Si ), (68) 

To have level-crossing we take Si < 0, si > 0. Eq. lffl%)l 
then indeed predicts 

W-W<0. (69) 

It is seen that violations of the minimal work principle 
exist for Sf > 0, and they are maximal for |ai2 (sf ) | 2 — > 1. 
This is seen to be the case in Figs. 0for some r near 
t = 1. Note from Fig. ^that both the first-order pertur- 
bation theory result (|fJS|l and the saddle-point approxi- 
mation to it given by Eq. I|67l) are adequately reproducing 
|ai2(sf)| 2 for r > 10. Moreover, the first-order perturba- 
tion theory is seen to provide an upper bound for the 
exact expression and predicts the appearance of large os- 
cillations around r ~ 1. 

For r — > 0, |ai2(sf)| 2 goes to its value sin 2 (sf — Sj) pre- 
dicted by the sudden perturbation theory. This amounts 
to \ai 2 (s f )\ 2 = 0.01991 for the situations in Figs.QEl 

Thus, all the basic conclusions drawn from the adia- 
batic perturbation theory are confirmed by this exactly 
solvable model. 




time- scale 



FIG. 1: Amplitude |ai2 (fif ) | 2 versus the time-scale r for sj = 
— 1.5, Sf = 1.5 and lo = 1. Full oscillating curve: the exact 
value which can reach unity. Dotted curve: result from a 
first-order adiabatic perturbation theory. The smooth curve 
presents a saddle-point approximation 16711 . 



large but finite) and analytically varying Hamiltonian 
H(t), the transition probability between non-crossing lev- 
els is exponentially small [42], while, as we saw, it has 
power-law smallness for two crossing levels. Then one 
neglects in (|26|l the factors \ak^ n {tf}\ 2 coming from any 
non-crossed levels k and n, and the problem is reduced to 
the two-level situation. Thus already one crossing suffices 
to detect limits of the minimal work principle (provided, 
of course, that the crossed levels are sufficiently popu- 
lated initially). 

The reduction to the two-level situation takes place 
also in a macroscopic system which has few discrete lev- 
els located below a continuous spectrum and separated 
from the latter by a finite gap. It is known, see e.g. 
in Ref. [4l|. that the transitions between these discrete 
levels and the continuous part of the spectrum vanish ex- 
ponentially for quasi-adiabatic processes. These discrete 
levels thus decouple from the rest of the spectrum and 
the problem returns to an effectively two-level situation. 



VI. CYCLIC PROCESSES AND 
REVERSIBILITY. 

The above results do not imply any violation of the 
second law in Thomson's formulation [9i lill ITll ] : no work 
is extracted from S during a cyclic process, W^> 0. We 
illustrate its general proof obtained in Refs. pOTol ITU 
[T^ | in the context of the level crossing model given by 
Eqs. P>P5). 

Assume that the trajectory R(t) — (hx{t),h2(t)) de- 
scribed there is supplemented by another trajectory R'(t) 
which brings the parameters back to their initial values 
(hi(ti), hs(ti) ) so that the overall process R+R' is cyclic. 
If R' crosses the adiabatic energy levels backwards, then 
at the final time t' { of the full cyclic process R + R' one 
has 

ei(* / f)=ei(i i ), e 2 (* / f )=e 3 (ii). (70) 
Together with 

|ai2(^)| 2 = |a 21 (^)| 2 = 1 - hx(4)| 2 = 1 - |a 12 (^)| 2 , 
and Eqs. 1(213112} as applied to the full cyclic process (i.e., 



D. Many-level systems. 



Let S have a finite number of levels, and assume that 
two of them cross. For quasi-adiabatic processes (r is 



To make a detailed comparison between Eqs. 1671 and H54I . sub- 
stract from H(s) (defined by 1581 ) an irrelevant factor ltr_ff(s), 
and note from Eqs. 1361 1381 that the following relations arc valid 
for the present model: 03 = = fi^i oil = t&j. Substituting them 
into Eq. J^J one gets Eq. 1571 . 




FIG 



time- scale 
2: The magnified version of Fig. The exact value of 



|ai2 (sf ) I versus the characteristic time-scale r for sj 
Sf = 1.5, and lo — 1. 



-1.5, 
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changing in them if — > t' f ), Eq. (|70|> implies: 

W c = |ai 2 (^)| 2 (pi-p 2 )[e 2 (ii)-ei(ti)] 

> = 0. (71) 

Eq. I|71|l confirms the intuitive expectation that non- 
adiabatic process are less optimal. In particular, this is 
valid if R' = R ml is exactly the same process R moved 
backwards with the same speed (mirror reflection) . Then 
W c = means that the original process R is reversible: 
According to the standard thermodynamical definition 
[USE! a process R is reversible, if after supplementing 
it with its mirror reflection process i? mr , the work done 
for the total cyclic process R + i? mr is zero. 

If R' does not induce another level crossing, i.e., if 
h\(s) and /i 2 (s) m Eq. i|36|) do return to their initial val- 
ues without simultaneously crossing zero, then the levels 
are interchanged 

£i{f l )=e 2 (t i ), e 2 (t / f )=ei(t i ), (72) 
and this time Eqs. I|26l I34fl imply 

Wc = {pi - P2)[ea(*i) - (73) 
W ; c >W^ c = (l-|a 12 (^)| 2 ) Wc>0. (74) 

It is seen that in contrast to the situation described by 
Eq. I|71|). non-adiabatic processes are more optimal if 
R + R' contains one level-crossing (or an odd number 
of them). We thus have found here a violation of the 
minimal work principle for a cyclic process. 

VII. CONCLUSION. 

This paper was devoted to one of the persistent ques- 
tions in statistical physics: whether the second law is 
valid for finite systems, and if it is not valid what are the 
possible scenarios of its violation. The proper way to an- 
swer this question is to take a formulation of the second 
law which has a clear meaning for finite systems, and to 
study it from the first principles of quantum mechanics, 
without invoking any postulate. 

Along these lines we have studied the minimal work 
principle for finite systems coupled to external sources 
of work. The principle states that the work done on 
an (initially) equilibrium system during a thermally iso- 
lated process is minimal for the smallest speed of the 
process. As compared to other formulations of the sec- 
ond law, this principle has a direct practical meaning as 
it provides a recipe for reducing energy costs of various 
processes. We gave its general proof for finite systems 
which starts from first principles of quantum mechan- 
ics and avoids the usual lore associated with the second 
law (chaos, thermodynamic limit, coarse-graining, vari- 
ous definitions of entropy). We have also shown that it 



may become limited if there are crossings of adiabatic 
energy levels: optimal processes need to be neither slow 
nor reversible. Already one crossing suffices to note vi- 
olations of the principle. If this is the case, the optimal 
process occurs at some finite, system-dependent speed. 

Level-crossing was observed, e.g., in molecular and 
chemical physics EJ- It is not a rare effect [22^ : 
if the number of externally varied parameters is larger 
then two, then for typical spectra level crossin gs a re even 
much more frequent than avoided crossings 22]. It is 
possible that the presented limits of the minimal work 
principle may serve as a test for level crossings. 

Together with the universal validity of Thomson's for- 
mulation of the second law IToL [TTL [T^ | . the limits of the 
principle imply that the very equivalence between various 
formulations of the second law may be broken for a finite 
system coupled to macroscopic sources of work: different 
formulations are based on different physical mechanisms 
and have different ranges of validity. Similar results on 
non-equivalence of various formulations of the second law 
were found in Ref. Si, where for a quantum particle 
coupled to a macroscopic thermal bath, it was shown that 
some formulations, e.g., the Clausius inequality and pos- 
itivity of the energy dispersion rate, are satisfied at suffi- 
ciently high temperatures of the bath, but can be invalid 
at low temperatures, that is, in the quantum regime. The 
physical mechanism responsible for this is the formation 
of a cloud of bath modes around the particle, well known 
in case of Kondo-cloud and polaron-cloud, but more gen- 
eral. 

There are still many issues to consider before the min- 
imal work principle and its limits can be said to be prop- 
erly understood. In particular, we need a better descrip- 
tion for the transition from finite to macroscopic systems. 
It might also be of interest to find an explicit example of 
a macroscopic system displaying limits of the principle. 
A separate problem is to study the minimal work princi- 
ple in the (semi)classical limit of finite quantum systems. 
This problem is special due to the fact that the limit of 
slow processes need not commute with the classical limit 

m 

We, however, had first to understand that limits of 
minimal work principle do exist in principle and some- 
times in practice, and this is the main message of the 
present paper. 
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APPENDIX A: LIMITS TO THE SECOND LAW. 

As was stressed in the Introduction, one of basic pur- 
poses of this paper was to understand limits of the min- 
imal work principle, which is a particular formulation of 
the second law. We therefore consider it necessary to 
clarify what in general we mean by "limits to the second 
law". 

It is first of all important that one is dealing with a 
concrete formulation of the second law, which, within the 
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studied situation, has a clear conceptual and operational 
meaning. For example, Thomson's formulation of the 
second law as applied to finite systems is certainly of 
that type, and so is the minimal work principle. Both 
these formulations of the second law operate with the 
concept of work, which — in contrast to entropy — is 
defined unambiguously for finite systems at all times. 

If within the studied situation we find the statement 
not satisfied — e.g., the minimal work principle need not 
be satisfied in the presence of level crossings — then we 
encountered a limit of that particular formulation. Note 
that for finite systems it will not be legitimate to speak 
about limits of any entropic statement of the second law, 
since entropy for a finite system does not have a clear and 
well-accepted physical meaning. One can define here var- 
ious entropies (especially in the quantum situation) and 
it is not clear with respect to which one the correspond- 
ing statements of second law are to be formulated. Here 
one encounters limits of the concept of entropy rather 
than limits of the second law. It is only in the macro- 
scopic situation that the coarse-grained entropy becomes 
a meaningful quantity on sufficiently long time-scales. 

In the context of limits of the second law there is an 
opinion that fluctuations of various quantities — e.g., 
fluctuations of work, provided they can be sensibly de- 
fined — provide violations of the second law. This opin- 
ion is supported by an observation that for a typical sys- 
tem in the thermodynamical limit the fluctuations vanish 
and, for example, Thomson's formulation in this limit is 
a statement on energy difference of a macroscopic sys- 
tem (i.e., in the definition of work one need not take 
averages over different systems or different realizations). 
According to this opinion, the second law is formulated 
as a statement on random quantities (e.g., on the random 
quantity work) , and once they can fluctuate for finite sys- 
tems, this gives violations of the second law. 

It is true that when the second law was first deduced 



in the XIX century, it was formulated for a single closed 
system, in a way resembling the laws of ordinary mechan- 
ics. However, already in the begin ning of XX century it 
was clearly understood l5ll l53j that this law has 
only a statistical character and refers to averages over an 
ensemble of identically prepared systems, rather than to 
a single system. This viewpoint became widely accepted 
when the first robust observations of fluctuations were 
made [EJ • Together with theoretical works of Boltzmann 
in kinetic theory of gases and of Smoluchowski, Fokker, 
Planck and Einstein in physics of Brownian motion, they 
formed a consistent picture of the second law as emerging 
from micro-physics through averaging over fluctuations. 
A detailed summary of this activity is presented in the 
1937 book by Epstein [5l|, while Tolman in 1938 H2 and 
Kemble in 1939 53] discuss theoretical aspects of the sit- 
uation. Since then, this understanding of the second law 
entered into several modern books of statistical physics 
and thermodynamics 0, @] . 

It is also true that when the statistical character of the 
second law was not yet widely accepted, several known 
scientists made statements on violations of the second 
law by fluctuations |54|. One of these citations is by 
Maxwell " the second law is drawn from our experience 
of bodies consisting of immense number of molecules.... 
it is continually being violated in any sufficiently small 
group of molecules. As the number ... is increased .. the 
probability of a measurable variation may be regarded as 
practically an impossibility. " If one cites this quotation, 
one should keep in mind that Thomson's formulation of 
the second law is perfectly valid for any "small group of 
molecules" . As we saw in the present paper, the same 
— modulo level-crossing — concerns the minimal work 
principle. 

In summary, fluctuations do not provide violations of 
the second law, since this law is formulated with respect 
to averages. For a recent discussion on this point see |55| . 



